Autonomic Computing: Learning to Repair Systems Effectively
نویسنده
چکیده
This research aims to integrate temporal difference learning methods into an autonomic learning system. The current Rainbow architecture models an adaptation engine that examines the system state to assess problems and determines a best course of action based on pre-programmed expertise. However, when expert knowledge is not readily available for a system, learning proper actions without a priori knowledge would often be more useful in practical situations. This report characterizes initial testing of a new learning engine designed to learn the proper corrective actions to take in order to repair problems in systems. Specifically, reinforcement learning methods can enable a system to learn the proper actions by actually trying out actions and seeing how well they do. This project compares Q-learning and SARSA temporal difference learning algorithms with the existing expert algorithm on the Carnegie-Mellon designed Rainbow simulated system. Since they approach reinforcement learning in a different way, comparing the success rate and speed of the algorithms on various scenarios will provide evidence about their efficacy. My research aims to clarify the benefits and costs associated with these algorithms.
منابع مشابه
Autonomic Computing Features for Large-scale Server Management and Control
A computer system would satisfy the requirements of “autonomic computing”, if the system can configure and reconfigure itself by knowing the operating environments, protect and heal itself from various failures or malfunctions. In order to know the environments and detect failure, an autonomic system needs the capability of acquiring the information through self-monitoring.Once the sequence of ...
متن کاملTowards Autonomic Computing: Adaptive Job Routing and Scheduling
Computer systems are rapidly becoming so complex that maintaining them with human support staffs will be prohibitively expensive and inefficient. In response, visionaries have begun proposing that computer systems be imbued with the ability to configure themselves, diagnose failures, and ultimately repair themselves in response to these failures. However, despite convincing arguments that such ...
متن کاملDisTriB: Distributed Trust Management Model Based on Gossip Learning and Bayesian Networks in Collaborative Computing Systems
The interactions among peers in Peer-to-Peer systems as a distributed collaborative system are based on asynchronous and unreliable communications. Trust is an essential and facilitating component in these interactions specially in such uncertain environments. Various attacks are possible due to large-scale nature and openness of these systems that affects the trust. Peers has not enough inform...
متن کاملAn Autonomic Service Oriented Architecture in Computational Engineering Framework
Service Oriented Architecture (SOA) technology enables composition of large and complex computational units out of the available atomic services. Implementation of SOA brings about challenges which include service discovery, service interaction, service composition, robustness, quality of service, security, etc. These challenges are mainly due to the dynamic nature of SOA. SOAmay often need to ...
متن کاملAn Autonomic Service Oriented Architecture in Computational Engineering Framework
Service Oriented Architecture (SOA) technology enables composition of large and complex computational units out of the available atomic services. Implementation of SOA brings about challenges which include service discovery, service interaction, service composition, robustness, quality of service, security, etc. These challenges are mainly due to the dynamic nature of SOA. SOAmay often need to ...
متن کامل